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Technical Field 

The present invention Is in the field of immunoassays for HCV (previously called Non-A, Non-8 hepatitis 
vims). More particularly, ft concerns combinations of HCV antigens mat permit broad range immunoassays for 
5 anti-HCV antibodies. 

Background 

The disease known previously as Non-A, Non-B hepatitis (NANBH) was considered to be a transmissible 

10 disease or family of diseases that were believed to be viraMnduced. and mat were distinguishable from other 
forms of viral-associated liver diseases, including mat caused by the known hepatitis viruses, l.e.. hepatitis A 
vims (HAV), hepatitis B virus (HBV), and delta hepatitis vims (HDV), as well as the hepatitis induced by 
cytomegalovirus (CMV) or Epstein-Barr vims (EBV). NANBH was first identified In transfused individuals. 
Transmission from man to chimpanzee and serial passage In chimpanzees provided evidence that NANBH was 

16 due to a transmissible Infectious agent or agents. Epidemiologic evidence suggested that there may be three 
types of NANBH: a water-borne epidemic type; a blood-borne or parenterally transmitted type; and a sporadi- 
cally occurring (community acquired) type. However, unta recently, no transmissible agent responsible for 
NANBH had been identified, and clinical diagnosis and identification of NANBH had been accomplished prim- 
arty by exclusion of other viral markers. Among the methods used to detect putative NANBH antigens and anti- 

20 bodies were agar-gel diffusion, countertoimunoelectrophoresis, immunofluorescence microscopy, immune 
electron microscopy, radioimmunoassay, and enzyme-linked immunosorbent assay. However, none of these 
assays proved to be sufficiently sensitive, specific, and reproducible to be used as a diagnostic test for NANBH. 

In 1987, scientists at Chiron Corporation (the owner of the present application) identified the first nucleic 
acid definitively linked to Wood-borne NANBH. See. e.g., EPO Pub. No. 318,216; Houghton et a!., Science 

23 244:359 (1089). These publications describe the cloning of an Isolate from a new viral class, hepatitis C virus 
(HCV). the prototype isolate described therein being named 'HCV1.* HCV Is a FlavMtke virus, with an RNA 
genome. 

U.S. Patent Application Serial No. 456,637 (Houghton et at), incorporated herein by reference, describes 
me preparation of various recombinant HCV polypeptides by expressing HCV cDNA and the screening of those 

30 polypeptides for Immunological reactivity with sera from HCV patients. That limited screening showed that at 
least five of the polypeptides tested were very immunogenic; specifically, those identified as 5-1-1 , C100, C33c. 
CA279a, and CA290a. Of these five polypeptides, 5-1-1 is located in the putative NS4 domain; C100 spans 
the putative NS3 and NS4 domains; C33c is located within the putative NS3 domain and CA279a and CA290a 
are located within the putative C domain. The screening also showed that no single polypeptide tested was 

35 Immunol ogicaify reactive with all sera. Thus, unproved tests, which read with all or more samples from HCV 
positive individuals, are desirable. 

Disclosure of the Invention . 

40 Applicants have carried out additional serological studies on HCV antigens that confirm that no single HCV 
polypeptide identified to date is immunotogicaBy reactive with all sera. This lack of a single polypeptide that is 
universally reactive with all sera from Individuals with HCV may be due, inter ajia, to strain-to-strain variation 
in HCV epitopes, variability In ths burners! response from Indh^uaJ-to-mdivldual and/or variation in serology 
with the state of the disease. 

45 These additional studies have also enabled applicants to Identify combinations of HCV antigens that pro- 
vide more efficient detection of HCV antibodies than 8ny single HCV polypeptide. 

.Accordingly, one aspect of this invention is a combination of HCV antigens comprising: 

(a) a first HCV antigen from the C domain; and 

(b) at least one additional HCV antigen selected from the group consisting of 
so (0 an HCV antigen from the NS3 domain; 

(fi) an HCV antigen from the NS4 domain; 
(Hi) an HCV antigen from the S domain; end 
(rv) an HCV antigen from the NS5 domain. 
In one embodiment, the combination of HCV antigens is in the form of a fusion protein comprised of the 
55 antigens. In an alternative embodiment the combination of antigens Is In the form of the individual antigens 
bound to a common solid matrix. In still another embodiment the combination of antigens Is in the form of a 
mixture of the individual antigens. 

Another aspect of the Invention is a method for detecting a ntibodies to HCV in a mammalian body compo- 
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nent suspected of containing said sntft>odles comprising contacting said body component with the above-des- 
cribed combination of HCV antigens under conditions that perm ft antibody-antigen reaction and detecting the 
presence of Immune complexes of said antibodies and said antigens. 

Another aspect of the Invention is a method for detecting antibodies to HCV In a mammalian body compo- 
5 nent suspected of containing said antibodies comprising contacting said body component with a panel of HCV 
antigens, simultaneously or sequentially, comprising 

(a) a.first HCV antigen from the C domain; and 

(b) at feast one additional HCV antigen selected from the group consisting of 
(i) an HCV antigen from the NS3 domain: 

to (H) an HCV antigen from the KS4 domain; 

(iii) an HCV antigen from the S domain; and 

(iv) an HCV antigen from the NS5 domain under conditions that permit antibody-antigen reaction and 
detecting the presence of immune complexes of said antibodies and said antigens. 

Another aspect of the invention Is a kit for carrying out an assay for detecting antibodies to HCV In a manv 
f 5 maiian body component suspected of containing said entibod ies comprising In packaged combination 

(a) said combination of HCV antigens; 

(b) standard control reagents; and 

(c) instructions for carrying out the assay. 

20 Brief Description of the Drawings 
In the drawings: 

Figure 1 b the nucleotide sequence of the cDNA sense and anti-sense strand for the HCV pdyprotein and 
the amino acid sequence encoded by the sense strand. 
25 Rgure 2 is a schematic of the amino acid sequence of Figure 1 showing the putative domains of the HCV 
polypeptide. 

Modes for Carrying Out the Invention 
so Definitions 

"HCV antigen" intends a polypeptide of at least about 5 amino adds, more usually at least about 8 to 10 
amino acids that defines an epftope found in an isolate of HCV. Preferably, the epitope is unique to HCV. When 
an antigen is designated by an alphanumeric code, the epftope is from the HCV domain specified by the 
35 alphanumeric. 

"Synthetic* as used to characterize an HCV antigen intends that the HCV antigen has either been isolated 
from native sources or man-made such as by chemical or recombinant synthesis. 

■Domains' Intends those segments of the HCV polyprotein shown In Figure 2 which generally correspond 
to the putative structural and nonstructural proteins of HCV. Domain designations generally follow the com/en- 
40 tion used to name RavMraJ proteins. The locations of the domains shown In Figure 2 ere only approximate. 
The designations *NS* denotes "nonstructural" domains, while *S" denotes the envelope domain, and 'C* 
denotes the nudeocapskJ or core domain. 

"Fusion polypeptide" intends a polypeptide in which the HCV antigen(s) are part of a single continuous 
chain of amino acids, which chain does not occur in nature. The HCV antigens may be connected directly to 
45 each other by peptide bonds or be separated by intervening amino acid sequences. The fusion polypeptides 
may abo contain amino acid sequences exogenous to HCV. 

■Common solid matrix" Intends a solid body to which the individual HCV antigens or the fusion polypeptide 
comprised of HCV antigens are bound covalently or by rioncovalent means such as hydrophobic adsorption. 

"Mammalian body component' intends a fluid or tissue of a mammalian individual (e.g., a human) that com- 
so monly contains antibodies produced by the individual. Such components are known in the art and include, with- 
out limitation. Wood, plasma, serum, spinal fluid, lymph fluid, secretions of the respiratory, intestinal or 
genitourinary tracts, tears, saliva, mflk, white Wood cells, end myelomas. 

•Immunologically reactive* means that the antigen in question wtil react specifically with anti-HCV antibody 
commonly present in a significant proportion of sera from tndlvlduais infected with HCV. 
55 Immune complex* intends the combination or aggregate formed when an antibody binds to an epitope on 
an antigen. 
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Combinations of HCV Antigens 

Figure 2 shows the putative domains of the HCV polyprotein. "The domains from which the antigens used 
In the combinations derive are: C, S (or E). NS3, NS4, and NS5. The C domain is believed to define the nuc- 

5 leocapsid protein of HCV. It extends from the terminal of the poly protein to approximately amino add 120 of 
Figure 1. The S domain is believed to define the virion envelope protein, and possibly the matrix (M) protein, 
and Is believed to extend horn approximately amino acid 120 to amino acid 400 of Figure 1. The NS3 domain 
extends from approximately amino acid 1050 to amino acid 1 640 and is believed to constftute the viral protease. 
The NS4 domain extends from the terminus of NS3 to approximately amino acid 2000. The function of the NS4 

io protein is not known at this time. Finally, the NS5 domain extends from about amino add 2000 to the end of 
the potyprotein and Is believed to define the viral polymerase. 

The sequence shown In Figure 1 is the sequence of the HCV1 Isolate. It is expected that the sequences 
of other strains of the blood-borne HCV may differ from the sequence of Figure 1. particularly In the envelope 
(S) and nudeocapsid (C) domains. The use of HCV antigens having such differing sequences is Intended to 

is be within the scope of the present invention, provided, however, that the variation does not significantly degrade 
the Imrnundoglcal reactivity of the antigen to sera from persons Infeded with HCV. 

In general, the HCV antigens wai comprise entire or truncated domains, the domain fragments being readfly 
screened for antigenicity by those skilled in the art The individual HCV antigens used in the combination wO) 
preferably comprise the Immunodominant portion (Le., the portion primarily responsible for the immunological 

20 read'rvity of the polypeptide) of the stated domain. In the case of the C domain it Is preferred that the C domain 
antigen comprise a majority of the enure sequence of the domain. The antigen designated C22 (see Example 
4, Infra), 19 particularly preferred. The S domain antigen preferably includes the hydrophobic subdomain at the 
terminal end of the domain. This hydrophobic subdomain extends from approximately amino acid 199 to 
amino acid 328 of Figure 1. The HCV antigen designated $2 (see Example 3, infra), is particularly preferred. 

25 Sequence downstream of the hydrophobic subdomain may be included In the S domain antigen ff desired. 

A preferred NS3 domain antigen is the antigen designated C33c That antigen includes amino adds 1 192 
to 1457 of Figure 1. A preferred NS4 antigen is C100 which comprises amino adds 1569 to 1931 of Figure 1. 
A preferred NS5 antigen comprises amino acids 2054 to 2464 of Figure 1 . 

The HCV antigen may be in the form of a polypeptide composed entirely of HCV amino acid sequence or 

30 it may contain sequence exogenous to HCV (i.e.. it may be in the form of a fusion protein that includes exogen- 
ous sequence). In the case of recombinant produced HCV antigen, producing the antigen as a fusion protein 
such as with SOD. alpha-factor or ubiquitin (see commonly owned U.S. Pat No. 4,751,180, US. Pat No. 
4,870,008 and US. Pat. Application Serial. No. 390,599, filed 7 August 1 989, the disdosures of which are incor- 
porated herein, which describe expression of SOD, alpha-factor and ubiquitin fusion proteins) may Increase 

35 the level of expression and/ or Increase the water sdubfl it/ of the antigen. Fusion proteins such as the alpha* 
factor and ublquffln fusion are processed by the expression host to remove me heterologous sequence. Alpha- 
factor is a secretion system, however, while ubiquitin fusions remain in the cytoplasm. 

Further, the combination of antigens may be produced as a fusion protein. For instance, a continuous frag- 
ment of DNA encoding C22 and C33c may be constructed, doned into an expression vedor and used to express 

40 a fusion protein of C22 and C33c In a similar manner fusion proteins of C22 and C100; C22 and S2; C22 and 
an NS5 antigen; C22, C33c, and S2; C22, C100 and S2, and C22, C33c, C100, and S2 may be made. Alter- 
native fragments from the exemplified domain may also be used. 

Preparation of HCV Antigens 

45 

The HCV antigens of the Invention are preferably produced recombinant or by known solid phase chem- 
ical synthesis. They may, however, also be isdated from dissociated HCV or HCV partides using affinity 
chromatography techniques employing antibodies to the antigens. 

When produced by recombinant techniques, standard procedures for constructing DNA encoding the anti- 
so gen, donlng that DNA Into expression vectors, transforming host cells such as bacteria, yeast, insect or mam- 
malian ceils, and expressing such DNA to produce the antigen may be employed As indicated previously, it 
may be desirable to express the antigen as a fusion protein to enhance expression, facilitate purification, or 
enhance solubility. Examples of specific procedures for produdng representative HCV antigens ere described 
tnthe Examples, Infra, and in parent application Serial No. 456,637. 

55 

Formulation of Antigens for Use In Immunoassay 

The HCV antigens may be combined by produdng them In the form of a fusion protein composed of two 
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or more of the antigens, by immobilizing them Individually on a common solid matrix, or by physically mWng 
them. Fusion proteins of the antigen may also be immobilized on (bound to) 8 solid matrix. Methods and means 
tor covaJentfy or noncovalently binding proteins to solid matrices are known in the art The nature of the solid 
surface w3l vary depending upon the assay format For assays canted out In microliter wells, the sofld surface 

5 wQt be the wall of the well or cup. For assays using beads, the solid surface will be the surface of the bead. In 
assays using a dipstick (i.e., a solid body made from a porous or fibrous materia) such as fabric or paper) the 
surface w8J be the surface of the material from which the dipstick is made. In agglutination assays the solid 
surface may be the surface of latex or gelatin particles. When individual antigens are bound to the matrix they 
may be distributed homogeneously on the surface or distributed thereon in a pattern, such as bands so that a 

10 pattern of antigen binding may be discerned. 

Simple mixtures of the antigens comprise the antigens In 8ny suitable solvent or dispersing medium. 

Assay Formats Using Combinations of Antigens 

1 s The HCV antigens may be employed In virtually any assay format that employs a known antigen to detect 
antibodies. A common feature of ail of these assays is that the antigen is contacted with the body component 
suspected of containing HCV antibodies under conditions that permit the antigen to bind to any such antibody 
present In the component Such conditions wQi typically be physiologic temperature. pH and ionic strength using 
an excess of antigen. The incubation of the antigen with the specimen is followed by detection of Immune com- 

20 plexes comprised of the antigen. 

Design of the immunoassays Is subject to a great deal of variation, and many formats are known In the ait 
Protocols may, for example, use solid supports, or immunoprecipaatlon. Most assays involve the use of labeled 
antibody or polypeptide; the labels may be, for example, enzymatic, fluorescent chem3uminescent radio-ac- 
tive, or dye molecules. Assays which amplify the signals from the immune complex ere also known; examples 

25 of which are assays which utilize bfotin end avidin, and enzyme-labeled and mediated immunoassays, such 
as ELISA assays. 

The Immunoassay maybe, without limitation, In a heterogenous or in a homogeneous format and of a stan- 
dard or competitive type. In a heterogeneous format, the polypeptide is typically bound to a solid matrix or sup- 
port to fecflitate separation of the sample from the polypeptide after incubation. Examples of solid supports that 

30 can be used are nitrocellulose (e.g., In membrane or microliter well form), polyvinyl chloride (e.g„ in sheets or 
microliter wells), polystyrene latex (e.g.. In beads or microliter plates, polyvinylidine fluoride (known aslmmu- 
lon 1 *), diazotized paper, nylon membranes, activated beads, and Protein A beads. For example, Dynatech 
Immuton 1 * 1 or Immuion™ 2 microliter plates or 0.25 inch polysterene beads (Precision Plastic Bail) can be 
used In the heterogeneous format. The solid support containing the antigenic polypeptides Is typically washed 

35 after separating it from the test sample, and prior to detection of bound antibodies. Both standard and com- 
petitive formats are known In the art 

In a homogeneous format the test sample is incubated with the combination of antigens fin solution. For 
example, it may be under conditions that wfll precipitate any antigen-antibody complexes which are formed. 
Both standard and competitive formats for these assays are known in the art. 

40 In a standard format, the amount of HCV antibodies forming the antibody-antigen complex Is directly moni- 
tored. This may be accomplished by determining whether labeled anti-xenogenic (e.g., anti-human) antibodies 
which recognize en epitope on anti-HCV antibodies will bind due to complex formation. In a competitive format 
the amount of HCV antibodies bi the sample is deduced by monitoring the competitive effect on the binding of 
a known amount of labeled antibody (or other competing ligand) in the complex. 

45 Complexes formed comprising anti-HCV antibody (or, in the case of competetive assays, the amount of 
competing anttoody) are detected by any of a number of known techniques, depending on the format For 
example, unlabeled HCV antibodies In the complex may be detected using a conjugate of antixenogenefc Ig 
complexed with a label, (e.g„ an enzyme label). 

In an Immunopredpltetion or agglutination assay format the reaction between the HCV antigens and the 

so antibody forms a network mat precipitates from the solution or suspension and forms a visible layer or fHm of 
precipitate. If no anti-HCV aritiT)c<Jy Is present In the test spedman, no 

The HCV antigens will typically be packaged In the form of a kit for use in these immunoassays. The kit 
wSl normally contain In separate containers the combination of antigens (either already bound to a solid matrix 
or separata with reagents for binding them to the matrix), control antibody formulations (positive and/or nega- 

55 five), labeled antibody when the assay format requires same and 6ignaJ generating reagents (e.g.. enzyme sub- 
strate) If the label does not generate a signal directly. Instructions (e.g.. written, tape. VCR CD-ROM. etc.) for 
carrying out the assay usually will be included in the kit 

The following examples are intended to Illustrate the invention and are not intended to lfm& the Invention 
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In any manner. 

Example 1: Synthesis of HCV Antigen C33c 

5 HCV antigen C33c contains a sequence from the NS3 domain. Specifically, it includes amino acids 1 1 92- 
1457 of Figure 1. This antigen was produced In bacteria as a fusion protein with human superoxide cSsmutase 
(SOD) as follows. The vector pSOOcfl (Steiner et at (1986), J. Virol. KJ:9) was digested to completion with 
EooRl and BamHI and the resulting EcoRI.BamHI fragment was tigated to the following linker to form pcflEF: 

10 GATC CTG GAA TTC TGA TAA 

<3AC CTT AAG ACT ATT TTA A 

A cONA done encoding amino acids 1192-1457 and having EcoRI ends was Inserted Into pcflEF to form 

is pcf 1EF/C33c This expression construct was transformed Into D121 0 E.colt cells. 

The transformants were used to express a fusion protein comprised of SOD at the N-termlnus and In-frame 
C33c HCV antigen at the C-termirais. Expression was accomplished by inoculating 1 500 ml of Luria broth con- 
taining amptdllin (100 micrograms/ml) wfth 15 mJ of an overnight culture of the transformanb. The cells were 
grown to an OD. of 0.3, IPTG was added to yield a final concentration of 2 mM, and growth continued until the 

20 cells attained a density of 1 0.D., at which time they were harvested by contrifugaiion at 3,000 x g at 4°C for 
20 minutes. The packed ceOs can be stored at -80"C for several months. 

In order to purify the SOD-C33c polypeptide the bacterial celts In which the polypeptide was expressed 
were subjected to osmotic shock and mechanical disruption, the insoluble fraction containing SOD-C33c was 
isolated and subjected to differential extraction with an alkaline-Nad solution, and the fusion polypeptide in 

25 the extract purified by ctiromatography on columns of S-Sepharose end Q-Sepharose. 

The crude extract resulting from osmotic shock and mechanical disruption was prepared by the following 
procedure. One gram of the packed cells were suspended In 10 ml of a solution containing 0.02 M Trte HCt, 
pH 7.5, 10 mM EDTA, 20% sucrose, and incubated for 10 minutes on fee. The cells were then pelleted by cen- 
trifugation at 4,000 xg for 15 min at 4°C. After the supernatant was removed, the cell pellets were resuspended 

so m 1 0 ml of Buffer A1 {0.01 M Tris KCt, pH 7.5, 1 mM EDTA, 14 mM beta-mercaptoelhano! [BME]), and incubated 
on ice for 10 minutes. The cells were again pelleted at 4.000 x g for 15 minutes at 4*C. After removal of the 
dear supernatant (periplasmic fraction i). the cell pellets were resuspended in Buffer A1, incubated on Ice for 
10 minutes, and again centrftuged at 4,000 x g for 15 minutes at4°C. The dear supernatant (periplasmic fraction 
II) was removed, and the cell pellet resuspended In 5 ml of Buffer A2 (0.02 M Tris HCI, pH 7.5, 14 mM BME. 

as 1 mM EOTA, 1 mM PMSF). In order to disrupt the cells, Ihe suspension (5 ml) and 7.5 mt of Oyno-milt lead-free 
acid washed glass beads (0.10-O.1 5 mm diameter)(obtained from Glen-Mil Is, Inc.) were placed in a Falcon tube, 
and vortexed at top speed for two minutes, followed by cooling for at least 2 mln on Ice; the vcrtaxlng-cooiing 
procedure was repeated another four tones. After vortexing, the slurry was fltered through a sdntered glass 
funnel using low suction; the glass beads were washed two Umes wWi Buffer A2, and the filtrate and washes 

40 combined. 

The Insoluble fraction of the crude extract was collected by centrifugation at 20.000 x g for 1 5 mln at 4°C, 
washed twice with 10 ml Buffer A2, and resuspended In 5 ml of MILU-Q water. 

A fraction containing SOD-C33c was isolated from the insoluble material by adding to the suspension NaOH 
(2 M) and NaCI (2 M) to yield a final concentration of 20 mM each, vortexing the mixture for 1 minute, centrifuglng 

45 & 20,000 xgfor 20 mln at 4°C, and retaining the supernatant 

In order to purify SOD-C33c on S-Seph arose, the supernatant fraction was adjusted to a final concentration 
of 6M urea, 0.05M Trs HCI, pH 7.5. 14 mM BME. 1 mM EDTA. This fraction was then applied to a column of 
S-Sepharose Fast Row (1.5 x 10 cm) which had been equilibrated wfth Buffer B (0.05M Tris HQ. pH 7.5, 14 
mM BME. 1 mM EDTA). After application, the column was washed with two column volumes of Buffer B. The 

so flow through and wash fractions were collected. The flow rate of application and wash, was 1 ml/mi n; and col- 
lected fractions were 1 mi. In order to identify fractions containing SOD-C33c, aliquots of the fractions were 
analyzed by electrophoresis on 10% pdyacrylamide gels containing SDS followed by staining with Coomassle 
blue. The fractions are also analyzable by Western blots using en antibody directed against SOD. Fractions 
containing SOO-C33c were pooled. 

55 Further purification of SOD-C33C was on a Q-Sepharose column (1 5 x 5 cm) which was equilibrated with 
Buffer B. The pooled fractions containing SOD-C33C obtained from chromatography on S-Sepharose was 
applied to the ootumn. The column was then washed with Buffer B, and eluted with 60 ml of a gradient of 0.0 
to 0.4 M Nad in Buffer B. The flow rate for application, wash, and elution was 1 rni/min; collected fractions 
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were 1 ml. All fractions from the Q-Sepharose column were analyzed as described for (he S-Sepharose column. 
The peak of SOD-G33C eluted from the column at about 0.2 M NaCl. 

The SOD-C33c obtained from the Q-Sepharose column was greater than about 90% pure, as Ridged by 
analysis on the pdyacrylamide SDS gets ami tmmunobtot using a monodonat antibody directed against human 
5 SOD. 

Example Z Synthesis of HCV Antigen C100 

HCV antigen C100 contains sequences from the NS3 and NS4 domains. Specifically. S includes amino 
to acids 1 560-1931 of Figure 1. This antigen was produced In yeast A cDNA fragment of a 1270 bp encoding the 
above amino adds and heaving EcoRJ termini was prepared. 

The construction of a yeast expression vector in which this fragment was fused directly to the S. cefevfeiae 
ADH2/GAP promoter was accomplished by a protocol which included amplification of the C100 sequence using 
a PCR method, followed by ligation of the amplified sequence into a cloning vector. After dordng, the C100 
15 sequence was excised, and with a sequence which contained the ADH2/GAP promoter, was Rgated to a targe 
fragment of a yeast vector to yield s yeast expression vector. 

The PCR amplification of C100 was performed using as template the vector pS^SSc^^ which had been 
linearized by digestion with Sail. pS3-56, which Is a pBR322 derivative, contains an expression cassette which 
is comprised of the ADH2/GAPOH hybrid yeast promoter upstream of the human superoxide oTsmutase gene, 
20 and a downstream alpha factor transcription terminator. 

The digonudeotide primers used for the amplification were designed to facfllate cloning into the expres- 
sion vector, and to introduce a translation termination cod on. Specifically, novel 5'-HtndUI and 3'-Sall sites were 
generated with fre PCR oligonucleotides. The oflgonudectide containing the Sail site also encodes the double 
termination codons, TAAandTGA. The oligonucleotide containing the Hindlil site also contains an untranslated 
25 leader sequence derived from the pgap63 gene, situated immediately upstream of the AUG codon. The 
p EC0633APDH gene is described by Holland and Holland (1980) and by Kniskem et a!. (19B6). The PCR primer 
sequences used for the direct expression of ClOOm were: 

5 ' GAG TGC TCA AGC TTC AAA ACA AAA TGG CTC 
50 ACT TTC TAT CCC AGA CAA AGC AGA GT 3' 

and 

as 

5' GAG TGC TCG TCG ACT CAT TAG GGG GAA 
ACA TGG TTC CCC CGG GAG GCG AA 3'. 

40 Amplification by PCR, utSWng the primers, and template, was with a Cetus-Perkm-Gmer PCR kit and was 
performed accordmg to the manufacturer's directions. The PCR conditions were 29 cycles of 94°C fora minute. 
37X for 2 mimjtes, 1TC for 3 minutes; and the final Incubation was at 72°C for 10 minutes. The DNA can be 
stored at 4°C or -20°C overnight 

After amplification, the PCR products were digested with H in dill and Sail. The major product of 1 .1 kb was 

45 purified by electrophoresis on a gel, and the eluted purified product was Rgated with a large Sall-Hfodlll fragment 
of DBR322. In order to Isolate correct recombinants, competent HB101 calls were transformed with the recom- 
binant vectors, and after cloning, the desired recombinants were Identified on the basis of the predicted size 
of Hindi))- Sail fragments excised from the clones. One of the clones which contained the a Hindi ll-S all fragment 
of me correct size was named pBR322/C100"d. Confirmation that this clone contained amplffled C1 00 was by 

so direct sequence analysis of the Hlndllt-SaU fragment 

The expression vector containing C100 was constructed by Hosting the Hhdlll-Satl fragment from 
pBR322/C10CTd to a 13.1 kb BamKI^Sall fragment of pBS24.1, and a 1389 bm BamHl-Hindiir fragment con- 
taining the ADH2f GAP pomoter. (The latter fragment Is described In EPO 164,550). The pBS24.1 vector is 
described In commonly owned U.S.S.N. 382,805 filed 19 July 1989. The ADH2/GAP promoter fragment was 

55 obtained by digestion of the vector pPGAP/AG/HIndill with Hindlil and BamHl. followed by purification of the 
1369 bp fragment on a gel. 

Competent HB101 cells were transformed wfth the recombinant vectors; and correct recombinants were 
identified by the generation of a 2464 bp fragment and 8 13.1 kb fragment generated by BamHl and Sail dlges- 



7 




EP0 450 931A1 

ton of the cloned vectors. One of Jhe doned coned recombinant vectors was named pC100~d#3. 

In order to express C1 00, competent cells of Saccharomyces cerevjsae strain AB122 (MATa teu2 utb3-53 
prb 1-1 122 pep4-3 prc1-407[dr-0J) were transformed with the expression vector pC100~<W3. The transformed 
cells were plated on URA-sorbitol. and Individual transformants were then streaked on Letr plates. 

6 Individual clones were cultured In Leu - , unr medium with 2% glucose at 30°C for 24-36 hows. One (Iter 
of Yeast Extract Peptone Medium (YEP) containing 2% glucose was inoculated with 1 0 ml of the overnight cut- 
ture. end the resulting culture was grown at 30*C at an agitation rate of 400 rpm end an aeration rate of 1 L of 
air peril of medium per minute (l.e. t 1 win) for 48 hours. The pH of the medium was not controlled. The culture 
was grown In a BioRo II tormentor manufactured by New Brunswick Science Corp. Following ferrnentation, the 

to ceSs were isolated and analyzed for C100 expression. 

Analysis for expressed C100 polypeptide by the transformed cells was performed on total cell fysates and 
crude extracts prepared from single yeast colonies obtained from the Leu- plates. The ceB lysates and crude 
extracts were analyzed by electrophoresis on SDS polyacrylamlde gels, and by Western biota. The Western 
blots were probed with rabbit polyclonal ant&odies directed against the SO D-C 100 polypeptide expressed in 

T5 yeast The expected size of the C 1 00 polypeptide is 364 amino adds. By gel analysis the expressed polypeptide 
hasaMW r of3&9K. 

Both analytical methods demonstrated that the expressed C100 polypeptide was present in total cell 
lysates, but was absent from crude extracts. These results suggest that the expressed C1 00 polypeptide may 
be insoluble. 

Example 3: Expression of HCV Antigen S2 

HCV antigen S2 contains a sequence from the hydrophobic N4erminus of the S domain. It includes amino 
adds 19^328 of Figure 1. 

The protocol far the construction of the expression vector encoding the S2 polypeptide and for its expres- 
sion In yeast was analogous to that used for the expression of the C100 polypeptide, described In Example 2. 

The template for the PCR reaction was the vector pBR322/PH4a, which had been linearized by digestion 
wfth Mndlll. Pl14a Is a cONA done that encodes amino acids 199-328. 

The oligonucleotides used as primers for the amplification by PCR of the S2 encoding sequence were the 
fo Rowing. 

For the 5 '-region of the S2 sequence: 

5' GAG TGC TCA AGC TTC AAA ACA AAA TGG GGC TCT 
ACC ACG TCA CCA ATG ATT GCC CTA AC 3'; 

and 

for the 3'-region of the S2 sequence: 

5 'GAG TGC TCG TCG ACT CAT TAA GGG GAC CAG TTC 
ATC ATC ATA TCC CAT GCC AT 3 ' . 

The primer for the Region Introduces a HindlH site and an ATG start codon into the amplified product The 
primer for the 3*-reglon Introduces translation stop codons and a Sail site Into the amplified product 

The PCR conditions were 29 cycles of WO for e minute, 37°C for 2 minutes, 72°C for 3 minutes, and the 
final incubation was at 72°C for 10 minutes. 

The maki product of the PCR reaction was a 41 3 bp fragment which was gel purified. The purified fragment 
was llgated to the large fragment obtained from pBR322 digested with Hlndlll and Sail fragment, yielding the 
plasmkl pBR322/S2d 

Ligation of the 413 bp HindDt-Sali S2 fragment with the U6 kb BemHI-Hlndlll fragment containing the 
ADH2/GAP promoter, and with the large BamHl-Sall fragment of the yeast vector pBS24.1 yielded recombinant 
vectors, which were cloned. Correct recombinant vectors were identified by the presence of a 1 .77 kb fragment 
after digestion with BamHl and Salt An expression vector constructed from the amplffied sequence, and con- 
taining the sequence encoding S2 fused dkactfy to the ADH2/GAP promoter is Identified as pS2ri#9. 
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Example 4: Synthesis of HCV C Antigen 

HCV antigen C22 » from the C domain. It comprises amino adds 1-122 of Rgure 1. 
The protocol for the construction of the expression vector encoding the C polypeptide and for to expression 
6 In yeast was analogous to that used for the expression of the C100 polypeptide, described supra, except for 
the following. 

The template for the PCR reaction was pBR322/ Ag30a which had been linearized with HlndlH. Ag30 is a 
cDNA done that encodes ammo adds 1-122. The oligonucleotides used as primers for the amplification by PCR 
of the C encoding sequence were the following. 

ro 

For the 5' -region of the C sequence: 

5' GAG TGC AGC TTC AAA ACA AAA TGA GCA CGA 
ATC CTA AAC CTC AAA AAA AAA AC 3' , 

15 

and 

2o for the 3' -region of the C sequence: 

5 ' GAG TGC TCG TCG ACT CAT TAA CCC AAA TTG CGC 
GAC CTA CGC CGG GGG TCT GT 3' * 

25 The primerfor the ^-region introduces a HJndlil site into the amplffled product, and the primer for the 3'-reg!on 
Introduces translation stop codons and a Sail site. The PCR was run for 29 cycles of 94°C for a minute, 37°C 
. for 2 minutes, 72*C for 3 minutes, and the find incubation was at 72°C for 10 minutes. 

The major product of PCR amplication is a 381 bp polynucleotide. Ligation of this fragment with the Sa- 
tt-Hindlli large Sall-Hindlli fragment of pBR322 yielded the plasmld pBR322SCZ 

30 Ligation of the 381 bp Wndlll-Sail C coding fragment excised from pBR322/C2 with the 1.36 kb BamHJ- 
WndlH fragment containing the ADH2/GAP promoter, and with the large BamHI-Sall fragment of the yeast vec- 
tor pBS24.1 yielded recombinant vectors, which were cloned. Correct recombinant vectors were Identified by 
the presence of a 1.74 kb fragment after digestion with BamHI and Sdl. An expression vector constructed from 
the amplified sequence, and containing the sequence encoding C fosed rJrectly to the ADH2/GAP promoter Is 

35 Identified as pC22. 

Analysis for expressed C polypeptide by the transformed ceOs was performed on total ceil lysates and crude 
extracts prepared from single yeast colonies obtained from the Leu* plates. The cell lysates and crude extracts 
were analyzed by electrophoresis on SOS pdyacrytamide gets. The C polypeptide is expected to have 122 
amino adds and by get analysis the expressed polypeptide has a MW r of approximately 13.8 Kd. 

40 

Example 5: Synthesis of NS5 Polypeptide 

This polypeptide contains sequence from the N-termlnus of the NS5 domain Specifically It Includes amino 
adds 2054 to 2484 of Rgure 1 . The protocol for the contraction of the expression vector encoding the NSS 
45 polypeptide and for its expression were analogous to that used for the expression of C33c (see Example 1). 

Example 6; Radioimmunoassay (RIA) for Antibodies to HCV 

The HCV antigens of Examples 1-5 wore tested in an RIA formal for their ability to detect antibodies to 
53 HCV in the serum of Individuals dlnicaily diagnosed as having HCV (Non-A, Non-B) and In serum from Wood 
given by paid blood donors. 

The RIA was based upon me procedure of Tsu end Herzenberg <1 960) In SELECTED METHODS IN CEL- 
LULAR IMMUNOLOGY (W.H. Freeman & Co.), pp. 373-391. Generally, microtfter plates pmmulcn 2, 
Renvwawefl strips) are coated with purified HCV antigen. The coated plates are incubated with the serum sam- 
55 pies or appropriate controls. During Incubation, antibody, if present, is immunologically bound to the solid phase 
antigen. After removal of the unbound material and washing of the mlcrotter plates, complexes of human antl- 
body-NANBV antigen are detected by incubation with 125 i-Sabe!ed sheep anti-human immunoglobulin. Unbound 
labeled antibody Is removed by aspiration, and the plates are washed. The radioactivity in Individual wells is 
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determined; the amount of bound human antt-HCV antibody Is proportional to the radioactivity tn the well. 

Specfffcaily, one hundred microliter aJkjuots containing 0.1 to 05 micrograms of the KCV antigen in 0.125 
M Ha borate buffer, pH 8.3, 0.075 M NaCt (BBS) was added to each well of a microti tar plate (Dynatech Immulon 
2 Removawefl Strips). The plate was Incubated at 4°C overnight in a humid chamber, after which, the antigen 

5 solution was removed and the wefls washed 3 times with BBS containing 0.02% Triton X-100 (BBST). To pre- 
vent non-specific binding, the weDs were coated with bovine serum albumin (BSA) by addition of 100 microliters 
of a 5 mg/ml solution of BSA tn BBS followed by incubation at room temperature for 1 hour, after this incubation 
the BSA solution was removed. The antigens In the coated wells were reacted with serum by adding 100 micro- 
liters of serum samples diluted 1:100 In 0.01 M Na phosphate buffer, pH IX, 0.1 S M NaCi (PBS) containing 10 

10 mg/ml BSA, and Incubating the serum containing wells for 1 hr at 37°C. After Incubation, the serum samples 
were removed by aspiration, and the welts were washed 5 ernes with BBST. Antibody bound to the antigen 
was determined by the binding of 125 Wabeted F'(ab)2 sheep antWwman IgG to the coated wefls. Ailquots of 
100 mtarolBers of the labeled probe (specific activity 5-20 micro curios/ rrdcrogrBm) were added to each well, 
and the plates were Incubated at 37°C for 1 hour, followed by removal of excess probe by aspiration, and 5 

is washes w&h BBST. The amount of radioactivity bound In each well was determined by counting in a counter 
which detects gamma radiation. 

Table 1 below presents the results of the tests on the serum from individuals diagnosed as having HCV. 
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INDIVIDUAL ANTIGEN 

S2 C22 C100 C33c NS5 

AVH-IVDA N P/N N P(44) N 

5 

AVH « acute viral hepatitis 
CVH 23 chronic viral hepatitis 
PTVH = post- trans fusion viral hepatitis 
IVOA = intravenous drug abuser 
crypto ■ cryptogenic hepatitis 
NOS * non-obvious source 
is P = positive 
N * negative 



Per these results, no single antigen reacted with aB sera. C22 and C33c were (he most reactive and S2 
20 reacted with some sera from some putative acute HCV cases with which no other antigen reacted. Based on 
these results, the combination of two antigens that would provide the greatest range of detection ts C22 and 
C33a rf one wished to detect a maximum of acute Infections, S2 would be included in the combination. 

Table 2 below presents the results of the testing on the paid Wood donors. 
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The results on the paid donors generally confirms the results from the sera of Infected individuals. 
Example 7: EU3A Determinations of HCV Antibodies Using Combination of HCV Antigens 

40 Plates coated with a combination of C22 and C33c antigens are prepared as follows. A solution containing 
coating buffer (50mM Ma Borate, pH 9.0), 21 mi/plate, BSA (25 mlcroojanisfrnJ). C22 and C33c (2.50 mfcrog- 
rams/ml each) is prepared Just prior to addition to the RemoveaweO Immulon I plates (Dynatach Corp.). After 
mixing for 5 minutes, QMJwdH of the solution is added to the plates, they are covered and incubated for 2 
hours at 37 °C, after which the solution is removed by aspiration. The wefts are washed once with 400 micro! iters 

45 wash buffer (1 00 mM sodium phosphate, pH 7.4. 140mM sodium chloride. 0.1% (WW) casein. 1% (W/V) Triton 
x-100, 0.01 % (WA/) Thlmerosa)). After removal of the wash solution. 200 micrditers/wetl of Postcoat solution 
(10 mM sodium phosphate, pH IX 150 mM sodium chloride, 0.1% (w/v) casein, 3% sucrose and 2 mM 
phonyimothylsulfbnyifhjofide (PMSF)) is added, the plates are loosely covered to prevent evaporation, and are 
allowed to stand at room temperature for 30 minutes. The weOs ere then aspirated to remove the solution, and 

60 lyophflfeed dry overnight without shelf heating. The prepared plates may be stored at 2-8°C in sealed aluminum 
pouches with desslcant (3 g Sorb-it™ packs). 

In order to perform the EUSA determination, 20 microliters of serum sample or control sample Is added to 
a well containing 200 microliters of sample dSuent (100 mM sodium phosphate, pH 7.4, 500 mM sodium 
chloride, 1 mM EDTA, 0.1% (W/V) Casein, 0.01% (W/V)™"«">sal, 1 % (WV) Triton X-1 00, 1 00 microgramsmil 

ss yeast extract). The plates are sealed, and are incubated at 37°C for two hours, after which the solution b 
removed by aspiration, and the wells are washed three times with 400 microliters of wash buffer (phosphate 
buffered saline (PBS) containing 0.05% Tween 20). The washed wells are treated with 200 mlcroifters of mouse 
enWrornajvlgG^iorse radish peroxidase (HRP) conjugate contained In a solution of Ortho conjugate dfluent 
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(10 mM sodium phosphate, pH 12, .150 mM sodium chloride, 50° (V/V) fetal bovine serum, 1% (V7V) heat 
treated horse serum, 1 mM KjFe(CN)* 0.05% <VW V) Tween 20, 0.02% (W/V) TWmerosal). Treatment b for 1 
hour at 37 °C, the solution b removed by aspiration, and the wefts era washed three times with 400 ml wash 
butter, which ts also removed by aspiration. To determine the amount of bound enzyme conjugate, 200 micro* 
s liters of substrate solution (10 mg Ophenylenediarrtine dihydnxfdcride per 5 ml of Developer solution) ta added 
Developer solution contains 50 mM sodium citrate adjusted to pH 5.1 with phosphoric add, and 0.6 microli- 
ters/mt of 30% HjOj. The plates containing the substrate solution are Incubated In the dark for 30 minutes at 
room temperature, the reactions are stopped by the addition of 50 microiiters/rnJ 4N sulfuric acid, and the ODs 
determined 

19 In a aimflar manner, EUSAs using fusion proteins of C22 and C33c, and C22, C33c, and S2 and combi- 
nations of C22 and C100, C22 and S2, C22 and an NS5 antigen. C22. C33c end S2. and C22, C100, and S2 
may be carried out 

Modifications of the above-described modes for carrying cut the invention that are obvious to those of skiO 
(n the fields of molecular bidogy, immunology, and related fields are intended to be within the scope of the fbi- 
f5 lowing claims. 



Claims 

A combination of synthetic hepatitis C viral (HCV) antigens comprising: 

(a) a first HCV antigen from the C domain; and 

(b) at least one addffional HCV antigen selected from the group consisting of 
(i) en HCV antigen from the NS3 domain; 
(0) an HCV antigen from the NS4 domain; 
(Hi) an HCV antigen from the S domain; and 
ftv) an HCV antigen from the NS5 domain. 

2. A combination of synthetic hepatitis C viral (HCV) antigens comprising: 
(a) a first HCV antigen consisting essentially of the C domain; and 
30 (b) a second HCV antigen from foe NS3 domain. 

X The combination of claim 2 wherein the ftst HCV antigen is C22 and the second HCV antigen Is C33c. 

4. The combination of claim 2 Including (c) a third HCV antigen from the S domain. 

35 

5. The combination of daim 3 Including (c) HCV anugen 52. 

0. A combination of synthetic HCV antigens comprising: 

(a) a first HCV antigen consisting essentially of the C domain; and 
40 (b) a second HCV antigen from the NS4 domain. 

7. The combination of daim 6 wherein the first HCV antigen is C22 and the second HCV antigen is C1 00. 

8. The combination of daim 6 Indudtng (c) a third HCV antigen from the S domain. 

45 

9. The combination of daim 7 induding (c) HCV antigen S2. 

10. The combination of daim 1, 2, 3, 4, 5, 0, 7. 8 or 9 wherein the combination is in the form of a fusion polypep- 
tide. 

so 

11. The combination of daim 1. 2, 3, 4, 5, 6, 7, 8or 9 wherein the combination is in the form of said first HCV 
antigen and said additional antigens individually bound to a common soBd matrix. 

12. The combination of ctatm 1 1 wherein the solid matrix is the surface of a microtiter plate well, a bead or a 
55 dipstick. 

13. The combination of daim 1, 2, 3, 4, 5, 6, 7. 8 or 0 wherein the combination Is In the form of a mixture of 
said first HCV antigen and said additional HCV antigen(s). 
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14. A method for detecting antibodies to hepatitis C virus (HCV) in a mammalian body component suspected 
of containing said antibodies comprising contacting said body component wfth the combination of synthetic 
HCV antigens ot dalm 1 . 2, 3,4; 5, 6, 7, 8. 9 t 10. 11. 12. or 13 under conditions that permit antfoody^ntigen 
reaction and detecting the presence of Immune complexes of said antibodies and said antigens. 

15. A method for detecting antibodies to hepatitis C vims (HCV) in a mammalian body component suspected 
of containing said antibodies comprising contacting said body component wfth a panel of synthetic HCV 
antigens comprising; 

(a) a first HCV antigen from the C domain; and 

(b) at least one additional HCV antigen selected from the group consisting of 
(I) an HCV antigen from the NS3 domain; 

(B) an HCV antigen from the NS4 domain; 
(6i) an HCV antigen from the S domain; and 

(iv) an HCV antigen from the NS5 domain under conditions that permit antibody-antigen reaction 
and detecting the presence of immune complexes of said antibodies and said antigens. 

16. A kit for carrying out an assay for detecting antibodies to hepatitis C antigen (HCV) in a mammalian body 
component suspected of containing said antibodies comprising in packaged combination: 

(a) the combination of synthetic HCV antigens of any one of claims 1-13; 

(b) standard control reagents; and 

(c) Instructions for carrying out the assay. 
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-3«1 GCC^CCCCTGATGGGGGCCA 
CGCTCGGGGCACTACCCCCGC? 

5*GAGClWTACTTACTCA5G5C»CrtCTCZriCATGACMAACTGCG 

259 CCATTCCCTTAGTATCAGIGTCC-GCAGCC^^ 

GCTACCGCAATCATACTCACAGCAarrCCG;«TCCro 

199 vlCGTCTGCGSAACCGGTGAGTACACCSSAATTGZCAGGACGACCCSGTCCT TT C TT SSA 
^CeaCACGCCTO&CCACTCATG?^^ 

139 

79 GTTC^GTCGCaAAAGGCCTTGTGGTACTCCCTGATAGCGTGCTTCCCAGTGCCCCGGGAG 
CAACCCAGCGCTTTCCGCAACACCATGACCGACTATCCCACGAACGCTCACGGGGCCCTC 

19 GTCTCGTAGACCGTGCACC 
CAGAGCATCTCCCACGTCG 

Arg Thr 

, ;^^^A»P-^y*ProGlijLysLysAfliu,ys^ 

- *^^*^CGAA?CCTAAACCTCAAAAAAAAAACAAACGTAACACCAACCGTCGCCCACAG 
TACTCGTGCTTAGCATTTCCACTTT'ITniT l GTTTGCATTGTGGTTGGCAGCGGGTGTC 

A«pVaUytP)^rcClyGlyGlyGliaieVdlClyGlyVftlTyrUuLauPrQArgArc 
61 ^^^CCCCMTCCCGGTCAGATCGTTGGTWAG 
CXCASnCAAGCGCCCACCGCCAGlCTAGCWlCaurCTCAM 

CCGG^ATCrAACCCACACCCGCGCTGCTCTTTCTGAACGCrCCC^ 

181 A^GT^CTATCCCCAAGGCICCI^ 

TCTCCAGTCGGATAGGGGTTCCCAGCAGCCGGGCTCCCCTCCTGGACCCGAGTC3GGCCC 

ATGC^AACCu3GGACATACCGTTAC?CCCGACGCCCACCCCCCCrACCCACGACACAGCG 

5$ra^*^ c ^ TCG ^cccA^ 

GCACCjAGAG^CGGATCGACCCCGGGGTGTCTGGGGGCCGaiTCCAGCGCGTTAAACCCA 

Ly6ValtieAspThrUuThrCysGlyi^«Al^pUuMetGlyTyrlleProLeuVal 
AAGCTCATCGATACCCTTACCTGCGGCTTCGCCCACCTCATGGGGTACATACCCCTCGTC 
TTCCAGTAGCTA1<K>GMTGCACGCCGAAGCGGCTGGAGTACCCCA1GTAIGGCGACCAG 

GlyAl 3 P r oLeuG I yGly Al a Al aAr a AlaLauAlaHl sSly ValArgValLeuGluAfip 
4 21 GGCGCrCCTCTiGGAGGCGCTGCCAGGGCCCTGGCGCASGGCGTCCGGGTrCTGGAAGAC 
CCGCGGGGAGAACCTCCGCGACGjTCCCGGGACCGCGIACCGCAGCCCCAAGACCTTC'IG 

Thr 

ClyVaU$r.r/rAlaThj^lyAsni#uProGlycysScrPhaSerJiePheLcaUuAla 
4 81 ^-^^CTATGCAACAGGGAACCncCTGGTICCTCTTTCTCTATCTTCOT 

w^«CAlTTGATACC?TCTCCCTTGGAAGGACCAACCAGAAAGAGAIAGAACCAAGACOCG 

-eul^erCysI*uTtavaPx^laS^AlaTy^ 
Ml -TGCTCTCTT3CTTG ACTG7CCCCGC ?TCGG rCTACCAACTC ZC CAACTCCACGGGGCTT 
GACGAGAGAACGAACrGACACGGGCCAAGCC3GATGGT?CACGCGTrGAGGTCCCCCGAA 

TyrHisYa:rhrA«nAspC7sProAsr.serScr:ieValTyrGluAlaAlaA5pAlaIle 
601 *ACCACGTCACCAATCATTGCCCTAACTCGAjXATTCTGXACGA<fGCGGCCGAlGCCATC 
ATGCTCCAGTCGTrACTAACCGGATTCACCrZATAACACATGCTCCGCCCCCTACGGIAG 

Figure 1 (Sheet 1 of 10) 
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LeuHls rarProClyCysValPrcCvsVal ArgGiuClyAsaAl aSerArgCyslrpva* 

661 ctgcacacrccggggigcgtccctrjcgttcgtgw 
sacctstgaggccccacccagggaacscaagc^^ 

AlaMerT^PrcThjrVtmaThrArgAspGlyiysLeuProUaThrClnLeuAr^Arq 

::: scgatgacccctacggiggcc^ccag^ 

CGCIAC TSGGG ATGCCACCGGTGGTCCCTACCGTTrGAGGGCCCCTGCGICSAAGC^ 

HtBlltAepUuUuVftlGlyScrAlftThrLeuCysSftrAlALeuTyrValSXyAfipLeu 
TBI ^CftTC3AIClCCTX6TCCCCACCGCCACCC?CrCTTCa^ 
GI«I*CCIAG;£GAACAGCCCTCCCG3TGGGAGA 

Cy«GlyS«rV*lPhelAuValGlyCloiAuPheThrPheSerProAr9ArgHlsttpt>2 
&41 ICCGGGTC lGTC r i r C tt C I CC CCCAACTGTXAOCTTCTCTCCCAGGCGCCACTSGACG 
ACCCCCAGACAXSAAAGAACA^CCGGTTCACAAGTGGAAGAGA^ 

thrGir£lyCysAsnCyiS«rlUryrP^ 
901 ACGCAAGGTTGCAATTCCTCTA rCTATCCCGGCCATAT AACGGGTC ACCGCATGGCmTGG 
TGCGTirCAACGTTAACGAGATAGATAGGCCCGGTATATTGCCCAGTGGCGTACCGTACC 

V*l 

Aj^tyjetHetJUnl^SftrPretfhiThjrAlft^ 
961 SATATCATGA^CAACTCGTCCCCIACGACGGCGTTGGTAATGGCTCAGCTGCTCCG^TC 
CTATACTACrACITGACCRCGGGATGCTGCCGCAACCATTACCGAGTCGACGAGGCCTAC 

PrcClnAlallei*UA«pMetIleAXaGlyAlaMi6trpGIyV«lLeuAlaClyIlcAla 
1021 tCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGGAGTCCTGGCGGCCATACCC 
GGTGrrCGGXAGAACCTGTA^AGCGACCACGAGTGACCCCTCAGGACCGCCCCTATCGC 

T7rPheSertetValClyABnTrpAiaLysValUuValVaU«ia.euLeuPncAlftGly 
1061 TATTTCTOCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGtGCTGCTGCTATTTGCCGGC 
AIAAAGAGGXACCACCCCTTGACCCGCTTCCAGGACCATCACGACGACGATAAACGGCCG 

YdlAspAlaGluThrHisvallhrGiyGlySerAlaGlyHisThxValSerGlyPhcVal 
114 1 GTCQ\CGCGGnAACCCACGTCACCGGGGGAAGTGCGSGCCACACTGTGTCTGGATTTGTT 
CAGCTGCGC~TTTGCGTGCAGTGGCCCCCTTCACGGCCGGTCXGACACAGA.CCTAAACAA 

ScrLeuUuMaProGlyAlaLyfiGlnAfinValGlnl^ulleA*nthrASDGlySerTrp 
1201 AGCCTCC?CCCACCAGCCGCCMGCA5AAC(TrCCACCTGArC^ 

TCCGACGAGCGTGGTCCGCGGTTCGTCTTSCAGGTCGACTAGTTGTGGrTGCCGTCAACC 

HisI*uten$erThrAial*uAsnCysAsnAjpSerLei^ 
1261 CACCTCAATAGCACGGCCCTGAACltKJATGATAGCCTCAACACC^ 

GTGGaCTTATCGTGCCGGGACTTGACGTTACTATCGGAGTTGTGGCCGACCAACCGTCCC 

LeuPheTyrHtsHisLyBPhaAsnScrSerGlyCysProGluArgDeuAlaScrCysArg 
1321 CTTTXIATCACCACAA^CAArTCTTCAGGClGTC^ 

GAAAAGATAGIGGTGTTCAAGTrcy«AAGTCCGACAGGACIC^ 

Prt^uThrAspPheAspGXnGlyTrpGlyPrDlleSerT/rAlaAsnGlySerGlyPro 
12B1 CCCCTTACCCAITTTGACCAGCCCrGCGGCCCTATCAGTTATGCCAACGC AAG CGGCCCC 
GGGCAATGGCTAAAACTGGTCCCGACCCCGG<^^ 

AjpGlriAxgProTyrCyaTrpHisryrProPrcLysProcysGlyneValProAlaLys 

1441 GACCAGCGCCCCTACTGOCGCAaACCCC 

CTGGTCGCGGGGATGACGACCGTGATGGGGGGTTTTGGAACGCCATAACACGGGCGCTTC 

SwvaiCysGiyProvalTyiCysFh^ThrProSerProVelValVaaciyThrTtoABp 
AGIGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGTGGTGGTGGG^ 
TCACAC ACAGCAGGCCA2ATAACGAAG rGAGGGTCCGCGCACCACCACCCWGCTGGCIG 

AxgSerGlyAlaProThrtyrSerTrpGiyGl^ 
AGGICC-GGCGCGCCCACCTACACCTGGGGTGAAMTGATACGGACGTCTTC 
TCCAGCCCGCGCGGGTGGATGXCGACCCCAClTrTACTATCCCTGCA^ 

Asnr^A^gPrcProUuGlyAsnTrpPheGlyCysThrTrpwetAsnSerThrGlyPhe 
Figure 1 (Sheet 2 of 70) 
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- 1 2. AATACCACCCCACCCCIGGGCAATTC^TTCGGTTGTACCT^ATCAACTCAACTGGirrC 
TTATKTCCSGTCCCGAeCCGTT^CAAGCCAACAT^^ 

TtelymiCy*GlyAlaProprccys val I leGlyGlyAl aGlyAsnAinThrLeuHls 
1681 ;^CAAAG7GTGCGGAg(^C?CCTTG7GTCATCGGAGGG0CGGGCAACAACACCC?CCAC 
^GTTTaCACGCCTOX^^^CA' 

Cy«ProTnrAcpCy«Ph«Ar7LyaHisPrcAtpAiaThrTyrSatAr9CyaCXyS«rCly 
1741 TGOp^ClGATIGCITC^^ 

ACCGGCTGACTMCGAAGGCGTTCGTAGGCCTGCGCTGTATGAGAGCCACGCCGAG^C^ 



Xla 



1301 



:esi 



?rorrpUu7^Prcs^gCysLeuVAlAspryrProTyrArcUuTrpHi$TyrProcy9 
cccicraicACAcccascrGCcrtsttc^ 

SGGACCIAGTGTGGGTCCACGGACCAGCtGATGGGCATATCCGAAACCGra 

ThrlleABnTyrThrllePheLysIlQAr^tTyrvalCXyClyValClaHlfeArgUu 
ACCAtC^ttACttttTATTXANJira 
raiAGTTtUlCTCGTATAAATrTIAGra^ 

1921 CA^raCTG^CTO 

CTTCCWCGGACGTTOCClCCGCCCrc^ 

CluLeuSerProl^uL*uI>urhrThm^ioTrpGlnYalLftuProCy^ftrPhoT^ 
:981 GWHrrCAGCCCGmOGCTGACCACIAe^^ 

cTCGAGTCGGGCAATtoCGACTGGTGATGTCTCAca^ 

rhrUuProAlalAuSemrtlyiAuiieHlsLcuHisGloA»nil8V«iAflpvalGln 
2041 A£CCIACCAGCCTTGTCCACCGGCC:X*TC^^ 

?GCGATGGTCGGAACA£GTGGCCGGACTAGGTGGA£GTGGTCTTGTA>CACC^ 

ryrLeuTyTGlyValGlySerSerllcAl^mrTrpAlBllcLyaTrpGluTyrValVal 
2101 ?ACTTGTACGGGGTGGCGTCAAGCATCGCGTCCT»^CCATTAAGTGGGAGTACCTCCTT 
ATGW^ATGCCCCACCCCWTTCGTAGCGCAjTCACCCGGlAATTCACCCTCATGCAGCAA 

i^ui^uPhpr, ft uifui^uAlaAjpAlaArgVftlCysSexCyiaLauTrpy^ 
2161 CTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTT6TGGATGATGCTACTC 
CAGGACAACCAACACGAACCTCTCC£CGCGCAGACGAGGACGAAC^ 

IleSerGlnAlaGl uAlaAlaLeuGluAsrXeuVal IleLeuAanAlaAl aSerLauAl a 
2221 ATAICCCAAGCGGA^GCGGCTTTGGAGAACCTCCTAATACTIAATGCAGGITCCCTGGCC 
TATAGGGXTCGCCTCCGCCGAAACCTCITGGAGCATtATGAATTACGTCGTAGGGACCGG 

GlythpHlBGlyVouValSerPhcLeuValPhePhxCysPJieAlaTrpTyrLeuLysGly 

::b: cujgacgcacggtcttgtatccttcctcctgttcttctgcttigcatggtatttgaagggt 
ccctccgtgccw^cataggaaggagcacaw^^ 

2341 AAGTGGGIGCCCGGAGCGGTCTACACC7TCTACGGGATGTGGCCTCTCCTCCTGCTCCTG 
nCACCCACCGGCCTCGCCAX^IGlt^AAGATCCCCTAaCCCCA^ 

LeuAlatAuPrrflnArgjaaTyrAAai^ui!^ 
2401 TTCGCCTTGCCCCAGCCGGCGTACGCCCTGGACACCGAGGTGGCCGCGTCGTGTGGCGCT 
AACCGCAACCCCGTCCCCCGCAIGCGCGACCTGTGCCTCCACCGCCGCAGCACAra 

ValValLeuValGlyLcuMetAlaUuThrLeuScrProTyrT7rLysAr?TyrIlcSer 
2461 5TTGTTCTCGTCGCGTtCATGGCGC7oACTCT3TCACCATATTACAAGCGCTATATCAGC 
CAACAAGWSCAGCCCAACTACCGC^ 

<A§n> 

TrpCysLeuTrprrpLeuGlnTyrPhftLauThrArgValGluAlaGinLeuHlaVdltrp 
2521 TGGTGCTTGTGGTCGCTTCAGTATTrTCtGACCAGAGTGGAAGCGCAACTCCACGTGTCC 
ACCACGAACACCACCGAAGTCAIAAAAGACTGGTCTCACCIICGCGTTGACCt'Jw % ACC 

IltProPrdLeuAsnValArgSlyGlyArgAapAla^^ 
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..:S6i A!^CCCCCCCCCAACCTCCGACGGGGCCGCGA^^ 

?AACCCCCCKACTTCC»CCCICCCCtCSCCCTGC&3CW^ 

KlsPrt3T^liWVtlPhaAipil«ThrLy«UuLeul^uUaV&iPh^lyProLcuTr? " 
:S4i CACCCCACtCTGgT ATTTG ACATCACCRAAriGCTGCIGG uCu r CTTCGGACCCCTTTGG 
CIGGSCTGAGACCATAAACTGTACTGGTrTAACGACGJ^ 

*l^uGlrJVlas«ri^ul^uLy5V4lProT7rPheValArgValClnGlyLeuLeuArg 
"CI ATTCTlCAAGCCAGTmcriAAAGTACCCTAC^^ 
^AAGTT(^TCAAACGAAI??CAI(tfG^^^ 

Z.61 TTC^CCGTTAGCeCGSAAGATCATCC^^ 
AA^OSCGC^TaCGCCTTCTACIACC^ 

,™ Jf^y*l^«*IhrGlyTI«yr/aiTyr AioHisLeuThr?rtSL«uAr^ AspTrpAla 
<B2I TTAGGGGCGCTTACTC3X!ACCIATGTT?ATAACCArc 
AATCCCCCCCAAIGACCCICCATACAAATATTG^ 

HtcA^lyl^uArgAa^uAlavalAlavalGluProValValPheSerGXnMatClu 

CXOTIECC»ACGCTCTAGACCGGC1U:CGA^^ 

ThrLyaLeiillellirTrpGlyMaAspThrAlaMaCyjClyAapIlelleAaiiClyteu 
2941 ACCAWXTCATCACGTG<^(^CAGATACCCCCGCGTC 

ItWna^AOTCAJCCCCCCCTCTATCCCSCOXAC^ 

p ^ v a3^ ^^Ar9GlyArgGlunel^ui^uGlYProAlaAa^l7tletValS^ 

3001 CCTCTTTCa^CCCWKaCCCGGGA^ 

GGAC^AGCCCGGCGrCCCCOGCCCTCTATCACGAGCCK^ 



3061 



LyaGlyTrpArgUulJauAlaProllei^AlaTyrAlaGljiGlnThrArgClyUuUu 
AAGGGGTGGAiili I11jL1\jCCGCC^TCACGGCGTACGCCC^GCACACAAGCGGCCTCCTA 



GiyCyallcIleinrSerLeuThrclyArgAspLyaAaaGlnValGluGlyGluVUGln 
3121 <X5GWtAtAATCACCAGCCTAACTGGCCGGGACAAAAACCAACTGGA3GGTGAGGTCCAG 
CCCACGTATTW^C5CGGATlCAC(^CCTCTTrTTCOTCACXTCC(^ 

I lc v *^SerThrAlaMaGlnThrPhalsuAlaThrCyaIleAinGlyValCyaTrpThr 
3181 AITGTG K^CTGCTOCCCAAACCTTCCIGGCAACGTGCATCAATGGGGTGTGCTGGACT 
TAACACAGtltMGAOKGITTGGAAGGACCGTIGCACGTAGTTACCCCACACGACCTGA 

vaiTymsGlyAlaClyThrArgThrlltAlaSarProLysGlyPrcValllcGlnKet 
2241 GTCTACCACGGGGCCGGAACGAG5ACCATCGCGXCACCCAAGGGTCCTGTCXTCCA6ATG 
CAGAtGCTCCCCCGCCCTTGCTCCTGGTAGCGCAGTGGGTTCCCACWACAGTAGGTCXAC 

Sar Thr 

TyrthrAsnValAaDGlnAapUuValClyTrpProAlaProGlijGlySerArgSftrUu 
3301 TATACCAAWTAGACCAAGACCTTGIG(»CrGGCCCCCTCCGCAJWU:iW 

ATAT&<TTACATCTGCTTCKK»AACACCCGACCGGGCGAGGOGTXCCAXCGGCGAGTAAC 

ThrPrDCysThrCyaGlySarSerAapLeuTyrLauVilThrAr^HltAlaAcpValllc 
AC^^TGCACTT^^ 

PreVilArgArgArgGlyAtpSftrArgGlyScr^uUi^erProArgProXlcSerTyr 
j421 -CCCT^CGa^^ItaiACOK*^^ 

GGGCAXCGGCCXCCCACTATCCTCCCCGTCG5ACGACAGCGGGGCCGGGTAAAGGATG 

UulysGlySerScrGlyGlyProLcuLeuCysProAlaGlyHlsAlaValGlyllePJie 
3481 ITGAAACCCTCCTCGCCCCC?CCGCTGTTGIGCCCCGCGGGCCACCCCGTG(MKMAm 
MCTTTCCCAGCACCCCCCCaCGCGACAACACGttra 

Ar^AlflAlaVftlCyaThrArsGir/alAlaLysAlaVaUacPhallfcProvalGluAJn 
3541 AGCTCCCCGGTCTGCACCCGIGGAGIGGC^ 

Figure i (Sheet 4 of 10) 
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3601 



TCttGSCGCC^CGTCGCCACCTCACCGATTC^ 

LcufiluXhrThjMetArgSerProValPhelhrAspAstiSerSarPrcPrcValValPro 
C?AGAGACAACttTCAO;iCCCC3GT<?7:CAC^ 

6lDSerPheGlnValAl«HlsLeuHl*AlaFroThrGlySftrGlyLysSerThrLy«Val 

3661 GTC^'AAGGTXATO 

ProAl&AlaT7rAlaAlAClT.2l7TyrLysVAlL«uVali^uAfiiiP7QS«x\'alAlaAla 
3^31 CCGGCrGCATAT6CAfiCTCAGG&C7ATAACCT^C7AG7ACTCAACCCCtC?G7TGCTGCA 
CCCCGACGTAXACGXCGACTCCCSATAJTCCACGATCAI^ 

Leu 

lhrl«uGl7?heClyAIaTyxxetSerLyspaaHi5Glyiiei^pProA£nXleAr?7hr 
3781 ACACTjGGCTntWTTCTTACATl^CAAGCCTCAIGuGMCGATCCIAACAtCAfiSACC 
TClCACCCC^.V^CCACttAATGTACACGTTCCGAS7ACCC7AGCTAG<^TTGTACTCCTGG 

Gl^alAjrqThrllcThxThrGlySerProUeThrTyrSerThrTyrGlvLysPhaLeu 
3841 GG^SACAACAATTACCaCTGGCACCCCCaT^ 

^CCCACTCITCTTAATG(IrGACC3TCGGGGTWTGCATGAGGTCGATGCCGtTCAA^ 

_ _ "^PGXyGXyCycSeiGlyGlyAlaTyrAspIlellftlltCyfiAfipGluCyflHisSer 
3901 GCCGACGGCGGGTGCTCGGCGGGCGCTTATGAt^TAATAATTTGTGACGAGTGCCACTCC 
CGGCT5CCCCCCACGAGCCCCCC3CGAATACTGTATTATTAAACACTGCTCACGGTGAGG 

<val> 

thrAspAiaThrSerllel^ClyileGiyThrvallAuABpGlftAlaGlunjrAlftGly 
3961 ACGGATGCCACaTCCATCTIGGGCAICGGCACTGTCCTTGACCAAGCAGAGA 
TXKrnCCGTCTAGGTAGAACCCStATCCGTGACA^ 

. AlaArgLeuValValLeuAlaThrAlaThrProProClyServ*iThrValProHiflPro 
4021 GCG^ACTGGTTGTGCTCGCCACCCCCACCCCTZCGGGCTCCGTCACTGTGCCCCATCCC 
CGCICTGACCAACACGAGCGGTG3CGGTGCGGAGGCCCGAGCCAG7GACACGGGGTAGGG 

A « CI ^l^« v *lAlaI^uScrThrThrGXyGluIl^roPheTyxGlytya^ 
4081 AACATCGACGAGGTTGCTCTCTC ^ACCACCGG AGAG ATCCCTTTTTACGGCAAGGCTATC 
TTGTAGCTCCTCCAACGACACA(%TCGTGGCCTCTC?AGGGAAAAATCCCCTTCCGATAG 

« 14 1 CCCCTCSAAGT>ATCAAGCGGGGGAGACAICTCATCITCTGTCATTCAAAGA^ 
GGGGAGCnCAnAGntCCCCCCTCTOIAGACTAG^ 

AspGluUuAlaAlaLy^uValAlaX«uClyilcAflnAlaValMATyrT7rw 
GACGAACTCGCCGCAAAi^TGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGT 
CTCCITGAG«GCCTTTCGACCA£CCTAACCCGTto 

T^uAapValSei^alIlttProThrSeTGlyA«pvuValValVftlAlaThrA*pAlaLau 
CTTGACGTGTCCCtCATCCCGACCAGCG(SCGATGr7CTCGTC3TCGCAAOCGATCCCCTC 
G»ACTGCAGAGGCAGTAGGCCTGGTCGCCGCTACAACAOCAGC^ 

Tyr 

KetT^ClyTvrThrGlyAspPh^spsemille^ 

ATGAC CGGC?ATACCGGC<^CTTCG ACTCGGICAXAG ACT^^TACGTGTGTCACCCAG 
4ACTGGCCGATATGGCCGCTGAAGCI3AGCCACTATCTGACGTTATGCACACAGTGGGTC 

(Ser) 

?>jValAspPheSerljBuAspPrcThrPheThrXieGluThrIleThrLeuProGlnA<p 
ACAGTCGATTTCAGCCTTGACCCTACC7TCACCATTGAGACAATCACGCTCCCC 
TGTCACCrAAAGTCGGAACTCGGArGCAAGZGGTAACTCTGTTAGTGCGAGGGGGTCCTA 

AlaValSerAZfThrGlAArgArgGl7AxgThrGlyAr^Giyiy3PrgGlyIl«TyrArg 
GCTGICTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGCAAGCCACGCATCJ : .V AGA 
CGACAGAGGGCGTGAGTTCCAGCCCCGTCCTGACCGTCCCCCTTCGGICCGTAGA jCXCT 
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?heVftlAl*ProClyGluArgPrDS«CIvHetPh«A*|)5«rS«rValL«uCy*GluC7at 
1 50 1 -7ICTGGC^CCGGGG6AGCGCCC2TCCG«CAT6TTCGAC1CGTCCCTCCTCTGTGAGTGC 
AAACACCGI^CCCCTC£CG3SAGtt^ 

TyrAsDXlaGlyCysAlttTrpTyrGlulcu-hrProAlaClttTmhrvaLvgUuArg 
4561 TATGA!3K»GGC7G7GCTTOIAICAGCTCA^^ 
AtWrKCGTCCGACACGAACCATACTCCW 

AlaXyrMetAsoThrProGlyl^uPrcValCy8GliiAspHisL«uCiuPhfiTrpGluGly 
4621 ^CGTACATGrj^CCCCGG^CrrCCCGTGIGCCA^ACXATCTTGAAlTlUGGGAi^GC 
CCCAtSIAnTST GG C re CCC G AAgSGCACAra^ 

ValPh8ThrClyUuTh4rKisIlftA5pAUHisPh«l^uSuGlD7hrL7«3InSerCX7 
4681 G TCTT TACACCCCTCACTCATAIAGATCCCCACTTTCTATCCCAGACAAAGCAG^ 
CAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTT^^ 

GluAanUuPrcTyrLauYiUliTyr^lriJaaThrValCyaAlaArgAlaGlnAlaPro 
4741 GAGMCCTTCCTTACCTGGTAGCGTACCAAGCCACCGrGTGCGCTAGGGCTCAAGCCCCT 
C TCT 1 3GAAGG AATGG ACCATCGCAIGG IT CGGTGGCACACGCGATCCCGAGTTCGGGGA 

ProProS«xTrp^pGlniletTtpLy3Cy9UuIlftArgLftuLyflPrcThrLeuHi3Gl7 
4 801 CCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTOJCCTCAAGCCCACCCTCCATCCG 
GGGGGZAGCACCCTGCTCTACACCtTCACAAACTAACCCGAGTTCGGGTGGGAGGTACCC 

FroThrProLcuLcuTyrArgLcuGlyAiaVftlGltiAsnGluIleThrLeiiThrKisPro 
4861 CCAACACCCCTGC7ATACAGAC7SGGCGCTG7TCAGAATGAAATCACCCTGA0GCACCCA 
CGTTGTGGGGACGATATXJTCTGACCCGCGACAAGTCTTACITTAGTGGG 

ValThrLysTyrricMetThrCytKetSftrAlaAspLa^luVftlvalihPSerinirTrp 
4921 GTCAC 1AAATACATCA7CACATGCATGTCGGCCGACCTGG AGGTCG TCACGAGCACCrGG 
CAGTGvTTTATGtAGTACTGTACGTACACCCGGCtGGACCtCCAGCiATCCTCGtGGA^ 

VaUeuValGlyGly VilLauAl iAI aLeuAl aAla Tyrcy aLauSarThrG ly Cy aVal 
4981 GTGC1CGTIGGCGGCGTCCTWCIGCXITGGCCGCGTATTGCCTCTCAACAGGCTGCGTG 
CACGA3CAACCGCCCC*GGACCGACGAAACCGGCGCAIAACGGACACT?GTCCGACGCAC 

ValllaVaiGlyArgVAlVall^ciGlyLysProAlailellePrcsAapArgGluVal 
3041 GTCATAGTGGGCACGGTCGTCTTGTCCC^AAGCCGGCAATCATACCrGACAGCGAAGTC 
. CACTATCACCCGTCCCAGCAGAACA^CCTO 

LcuTyrArqCluPheMpGluKetGluGluCysSerGlnHisUuPror7rIleGluGln 
5101 CTCTAZCGAGAGTTCCATGAGATCGAAGAGTGCTCTCAGCACrtACCGTACATCGACCAA 
GAGATCCCTCtCAAGCTACTCIACCllCTCACGAGACTCGTGAATGGCATGTAGCTCGTT 

GlyMatMati^uAlaGluGlAPheLysGliiLysAlalAuGl>l«uLeuGlnT^AlaSer 
5161 GGGAT>3ATGCTO?CC6AGCAGTTCAAGCAGAACCCCCTCGGCC7CCT6CACACCGCCTCC 
CCCIA:iACGACCCCCICCICAACnCGTClTCra 

ArgGinAlaGluVaillaAlaProAlaValGlnThrAsiiTrpGlnLyiLeuGluThrPhe 
5221 CCTCAGGCAGAGGTXATCGCCCCTGCTGTCCAGACG^CTGGCAAAAATTCGAGACCTTC 
GCAGTCCGTCrCCAATAGCGGGGACGACAGGTCTGGtTGACCGTTTTTGAGCtCTGGAAG 

TrpAlaLysHisMetTrpAanPhe J laSerGly XleGlnTyrLeuAlaClyLouSerThr 
528 1 TGGGCGAAGCATA7GTG5AACTTCA7CAGTGGGATACAATACTTGGCGGGCXXGTCAACG 
ACCCGCTTCGTATACACCTTGAAiTACTCACCCTATGTTATGAACCGCCCGAACAGTTCC 

LeuproClyAfinProMalieAlaSerUu^atMaPheXhrAlaAlaValThrSarPro 
5341 nGCCTGGTAACCCCGCCATTGCTrCATTGATCGCTTTTACAGCTGCIGlCACGlGCCW 
GACGGACCATTGGGGCGGTAACGAAGIAACIACCGAAAATGrCGACGACACTGGTCGGGI 

LeuT.^rT^rSerGl nThrLftuLeu PheAan I leLeuG lyClyTrpVolAlaAl iGlnUu 
5401 CTAACCACTAGCCAAACCC7CCTCTTCAACATATTGGGGG5GTGGGTGGCIGCCCAGCTC 
GATTGGTGATCGSTTTGGGAGGAGAAGTTGTATAACCCCCCCACCCACC3ACGGGTCGAG 

AlaAlaProGlyAlaAlaThraaPhcValGIyAlaGlyLauUaGlyAlaAlallaCly 
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CGGCGCra^ttCGSCGATGACGC^CACCCGC^^ 

SeiYalClyl^\K3lyLyfiVaU^ull«AfipIleLeuiaaGlyTyT<31yAifii31yVaUta 
5521 ttTWTOJAraraGAAGffrCCTCA^ 
TCACAACCIGACC^CTTCCAWAGTATra 



ClyAlal^uVamaPheLyslleMetSerGlYCluValProSerthrGluAgpLeuVal 
5581 GGAGCTCTICTGGCAITCAAGAI^TCAG-G^^ 
CCTCGAOttCACCBIAMSTTCTJtf^ 

XJiOAui^uProAlaiifti^erProcIyAlaLcuvaiValGlyvaivalCyaAlaAla 
564 1 ^ATCXiVCTGCCCGCCATCCTCTCCCCCGGAG ZwTCGTAGTGGGCGTGGTCtGTGCAGCA 

IleLtfuArg/arqHiaValGlyProClyGluClyAXaVal^lntr^HetAsnAr^LauIle 
370; AIACXCCGCCGGttCGnWXCGGCK^GGGGCAW^ 
TATCACGCCCKCGTGCAACCCGCCCCSC^^ 

AlaPheAlaS«rApg<ayAsrJiisVeUerProThrHisT7rValProGluSerA«pAla 
5761 CCCraCCCTCCttGOGAACCAWTTTCCCC 
CGCAAGCGOUa^CCCCTIGGTACAJUW^ 



AlaAlaArgYalThxAlaIlel*u5«x5«LeuThiYalT^ 
5B21 GCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTC^ 
ay£O^CA<nCACGGIAlCA£Ttt 

HiBClaTrpXlcSerScrGluCysThrThrPit^sSArflySerTrpLeuArgAaplle 
5881 • CACCAGTGGATAAGCTOGGAGTGTACCACTC CATGCTCCGGTTCCTGGCTAAGGGACATC 
GTGCTCACCTATrCCAGCCTCACATGCTt^(OTACGAGGCCAAGCACCGATTCCCTCTW 

Trj^pTrpIl^fiCluValLeuSerABpPhcLyrrhrTrpL-uLysAlaLysLeuMftt 
594 1 TGGGAClGGATATtjCCACGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATG 
AGCCTSACCTATACCCTCCACAACTCGCTGAAATTCTt^ 

rroGlnUuProCiyIl«ProPheVaLSerCysCl!U^gx;lyTyTLyflGlyValTrpArg 

6001 CCaCAGCTGCttGGGATCCCCITTC^^ 

GGTCTCGACGGACCCTAGCGGAAACACAGGACGGTCGCGCCCAtAtTCCCCCAGACCCCT 



GlyAspGlvIleMfttHirthrAr^CyeHlsCysGlyAlaGlcIleThzGlyHlftVallys 
6061 GTCGACCG^TCATGCACACTCGCltlCCACTGTGGAGCTCAGATCACTGGACATCTCAAA 
CACCT3CC3TAGTACGTGTCACCGACGGTGACACCTCGACTCTAG7GACCTG 

AanGiyThjrMetArgUaValGlyPrt^gTra 
6121 ;j£GCCACGATCAGCATC:nOTGTCCIAG(^C?Cra 

TTCCCCTCCIACTCCIAGCAGCCAGGATCCTGGAC G I tX TT G T A CACCTCACCCTGGAAG 

PtolleAsnAlaTyrrWhrtlyPrt^sTturProLeuPrcAlaPr^^ 
6181 CXCAITAATG<^CACCACCGGCCCC?G 

GCCTAATTACGCATG TOJTGCCCC GGCACATGGGGGGAAGGACGCC GCTTGATG TGCAAG 

Alal^uTrcArqValS«rAUCluGluTyrvalGluiieArgGlnValGlyABpPhcHi« 
6241 GGGGTATCCAGGGTGTCTGGAGAGGAATATGTGGAGATAAGGCAGCTGGGGGACTrCCAC 
CGCGATACCTCCCACAGACStCTC^^ 

r7rvalT^lyMctT^hrAspAsnLeuLysCy5Prcry5GlnVal?rcSerPrc5Clu 
6301 rACCTCACGGGTATGACTACT*ACAATCTCAAAIGCCCGlGCCAGGTCCCATCGCCCGAA 
AtCCACtGCCCATACTGATG ACTGTTAGAG t TTACGGG CACGGTCCAGGG IAGCGGGCTT 

PhtPhoThrGliiLeuAflpClyValArgLeuKisArqPheAlaProPrcCysLyBProUu 
6361 7TTTTCACAGAATTCCACCGGG rGCGCCrACATAGGTTTGCGCCCCCCAGCAACCCmO 
AAAAAGTGTCTTAACCTCCCCCACGCGGATGlATCCAAACGOGGGCmACGTTCGCpGAAC 

l^uArgGluGluValSwrPhaArg^aiClyLtuHlaGluTyrProvalGlyScxClnUu 



c«y> 



(Hlscya) 
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6421 CTGC«J»GGAGGTA^ 

<*CCCC:3CCICCAT1«TAAG7C7CATCCT^ 



6481 




CCTTCCGAGCCCSAACCG^^ 
GGAACCCTOCGCCrrCCCCTSCACCGGCAC^CTCC^ 

6541 A^CAGO^ 

TJ^TCGTOTaSCCGGCCCGCTC^ 

SftfProAapAlaGluJ^IlaGluAlaAEDLeuLeu^ 

Atf^ACIACGACICGWJUTCT^ 

XleThrArgValGluSextfluAsnLv^ 

AliCluGluAspaiuArgCluIlBS«xValProAlftClull^uAx9LysSer^9^ 

CCCCTCCTCCTGCTCCCCCTCTACAGGCATC3CCGTCTTTW»^a 

PhaAlaClnAlai^uProvallrpAlaArijPro^^ 
6841 •■ nCSCCCAG^CTGCCCGTTTG^ 

JWK^TCCGGGACGGGCAAACCCGCGCCGGCC^ 

IxpLviLYaProA^pTyrCluProProValValHisGlyCysProteuProProProLys 

6901 JGGAAAAAttCCGACTAC^ 

ACCTrmCCCCCTCATCCTTCCTGCACACCACCW 

SerPrt>Prt>valProPrt>ProJ^!-yaLysAr^ 
6961 TCCCCTCC^TGCCTCCGCCTCOauU^ 

AGGGGAWACACCGAGGCGGAGCCTTCraCCCTC^ 

(Sex) 

SerthrAlal*uMaGluJU»*iaThr^ 
TOn TCWTGCCn^CGAGCTCGCCACCA^ 

AGATGACGGAACCGGCTCSAGCOnGGTCrra 

inrGlyAspABnlhrThr^SarSerGlurrom^ 

70B1 ICOQStokCMtttt^^ 

TGCCCCCTCmiCCTCTTGTA(^A<^CICCSCCGGGGMGACCGACCGGs^CTGAGG 

( PhftAla ) 

ft fr ftijtCixiq^rryrSarS£XMgtPrG?TOl^uGluGlyGluProGlyAgpPr^p Le" 
7141 C^CTGAGTCCTArrCCTCCATGCCCCCCCTW 

CTCCC^CTCAGGAiAAGGAGGTACGGGGGGGACCTCCCCCTCGGACCCCT^ 

S«rAapGlyScr?rpSerThrVal5cxSer<5luMa^nAlaGluAapValValCy^8 
7201 AGCGA^GTCATGGTC^CGGTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTCOT^ 

SexttetSexTyrSerTrpThrGlyAlaUuVal^ 
7261 T^TCTCT^^a^GW 

agttacagaatgagaacctctccgcgtgagcagtggggcacgcggcgccttcttgtct^ 




7321 

ThrSerArgSerAlaCysGlrArgC^^ 
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"231 ACCTCACGCAGTwCTTSCCAAAGGCAGAAGAAAGTCAC^^ 

XGAGTGCGTCi^AAacrrrCC3TCrTCTTrCWwIAAACTO 

AspSer HlstyiGinAspValLeuLrsGl-VallrsAl aAl aAlaSeriysValLysAla 
*44l ^ACAGCCArTACCAGGACGTACtCAAGGAGGrrAAACCAGC^ 

<Ph*J 

Aani«tLmiS«xVaIGiuGluAlaCysSerLBuThrProProatsSerAlaLysserX*ys 
'501 AACXTCCTAlCCGTAGAGGKAGCTTGC^^CrGACGCCCCC^C^CTCAGCCAAATCCAAG 
TSGAACGAIAGGCftTCI C CrrCCAACGTCGGftCXCC^ 

PheGiyTyrGiyiaaXysAspVaXArgCysHlsMaArgtyiMmimRlsIIeAsn 
'561 r^GTTArOM<X^AJJ«^TCCG!^ 

AAACCAATACCCCGTT7?CTGCAGGCAACGG7ACGGTCTTTCCGG<^ 

ScrV&lTrpLysAspUuLeuGUJUpMnValThrProIleAspThjrrhrlleMctAla 
^621 rCCCTCTGGAAAGACC: rCIC GAAGACAATGTAACACCAATAGACACTACCATCATCGCT 
AGGCACACCTTrCTGGAAGACCTTCTGTX&OTTCTGGTTATCTGTGrtTGC7AGTACCGA 

LyaAanGluValPheCysValGloProGluLysGlyGlyAr^LyaProAlaArgLaullc 
76B1 AAGAACGAGGTTTTCTGCCTTCAGCCTGAGAAGGGGGCTCCTAAGCOGCT 

nCTTGCTCCAAMGACGCAAGTCGGACTCTtCCCCCCAGCATTCGGTCGAGCAGAGTAG 

VaiPheProtepl^uGlyValArgValC^sGluLyKMetAlaLeuTyrAspralValThr 
~U GTGTTCCCCGATCTCWGCGTGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACA 
CACAAGGGGCTAG ACCCGCACGCGC ACACG CTTTTCTACCCAAACATGCTGCACCAATGT 

LyaLeuProLeuAlaValMctGlyserserr/zGlyPheGlnTyrStrProGlyGlnArg 
7601 AAGCTCCCCTTGGCCGTGATGGCAAGCTCCTAOGGATTCCAATACTCACCAGGACAGCGC 
TTCGAGGGGAACCGGGACIACCCTTCGAGGATGCCTAAGGTTAlGAGTGGTCCTGtCGCC 

ValGluPhcLeuValGlnAlaTrpLysSerLysLysThrProKatGIyPhcSerTyrAsp 
7861 GTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGT^ 

CAACT^AAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTA 

TXrArgCysPheABpSerTfcrVairhiGluSer^IleJtf^^ 
7921 ACCCGCTGCTTTGACXCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAAtCSAC 
TGGGCGACCAAACrGAGGTGTCAG7GAC7CTCGC7GTAGGCATGCCXCCTCCST?A$ATG 



T981 



GlnCysC7SA«pUuAspProGliiAlaArgVilAlaIleLY5SerLeaThiCluArgLeu 

CMTGTrcTCACCTCGACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGCTT 
GTTACAACACTGGAGCTGGGvGTICGGGCGC\CCGGTAGTICAGGGAGwOCTCTCC$AA 



(Gly) 

TyrValGlyGlyProLeuThrAfinScxAr^lyGluAfioCyc^lyTyrArgAr^cysArg 
$041 TATGT2CGGGGCCCTCITACCAATTCAAGGGGGGAGAACTGCGCCTATCGCAGGTGCCGC 
ATACAACCCCCGGG AGAA2TOTTAAGTTCCCCCCTCTTGACGCCG ATAGCGTCCACGGCG 

AlaSerGlyValUuThrThrSerCysGlyAioThrl^uThrCysl^IleLysAlaArg 
S101 ^^AGCGGCGTACTGftCAACTAGCTGTGCTAACAGCCTCACTTGCTACAICAAGGCCCK 
CGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGAACCATGTAGTTCCGGGCC 



8161 



AlaJaaCysArgAlaAlaGLyLcuGlnAspc^sThry.ftt±cuValCy^lyA«pAspLeu 

CCTCS^CACClCGGCGTCCCSAGGra 



ValVai:ieCy£GluSerAlaGl*/ValGlnGluA£pAlaAlaSext«uArqAlBPheThr 
92*; GTCGT7ATCTGTGAAAGCGCGGGGG7CCAGGAGGACGCGGCGAGCCTGAGAGCCrrCACG 
CACCAATACACACrTTCCCCCCCCCAGGTCCTCCTCCGCCG^ 

GluAlaMetThrArgryrSarAlaProPrcKJlyAspProProGlnProGluTyrAspUu 
82e I GAGGCrAlXiACCAGGTACTCCGCCCCCCCTGGGGACCCCCCACAACCAGAATACGACTTG 
CTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTCQjGGGTGT7GG7C77AIGCTGAAC 

GluUuIleThrSerCy$SerScrAsnVal£trValAlaRlBAKpGlyAlaGlyLy»Arg 
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2341 GAGC7CATMCATCA3GCICCTCCAACG70rCAGTC6CCCACGACGGCGCTGSMAGArc 
CTCGAGTATTGIAGTACGAGCAC^TGCACASTCAG 

valTyrT7rLeuThrArgAjpProThrThr?rc>LeuAlaArc/wL^5TrpGluThrXla 
*401 ^TCrACTACCTCACCXCTGACCCTACJJ^CCCCCCTCGCGJ^ 

ArcHl«ThrProValAinSerXrpI-euClyXsnIi«IleMatPh«AlAProThrLBuTrp 
8461 AGACACACTCCAGTCAATICCTCGCTAGGCAACAfAATCATGTT^ 
?CTCIGTGA5G1£AGTTAAGGACCGAra^ 

AlaArgMetlleLeutotThrHlaPhePheSerVal^ 
8521 GCGAGGATGATACTGATGACCCAXTrCTTTACCGTCCTTAtAGCCACGGACCACCTTCAA 

CGCICC^ACTATCttCUCTCGGIAAAGAro 

ClnAl&l^uA«pCy5GluIl«17rCl7AlaCysT7rSerILeCluProLeuA«pl^uPro 
B581 CAGGCCCTCGATTGCGAGATCTACGGC^^TGCTACTCCATAGAAC 

GTCCCGGAGCTAAC(X!TCTAGAIGCCCCG<^CGATGAGGTATCTTCCTGAACTAGATGCA 
i 

ProlleIleGlnArglAuJttsGlyl*uSiar^^ 
B641 CCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACTCCACACTTACTCTCCAGGT 
GGTTAGIAAG I rTCTC AgGTACCGSAGTCCCGTAAAAGTGAGGTGTCAATGAGAGGTCCA 

GluileAsr^gVammaCyai^uArgLysr^uGlyvalPrcProLeuArgAlaTr? 
8101 GAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGTACCGCCCTTGCGAGC 

CTTTAATTAICCCACCGGCGTACGGaGTCTTTTGAACCCCATGGCGGGAACGCTCGAACC 



ArgHi sArg AlaAr gSerValArgAl aArgl^uI^uAl aArgGlyG lyArgAl aAlalle 
6761 AGACACCGGGCCCCGAGCGTCCGCGCTAGGCTTCTGGCCAGAGGAGGCAGGGCTGCCATA 
TCTGTGGCCCGGGCCTCGCAGGCGCGATCCGAAGACCCGTCTCCTCCGTCCCGACGGTAT 

CyBGlyLyfiTyrLeuPheAsnTrpAlaValAraThrLysLauLy«L«uThrPre>IleAla 
8821 TCTGGCAAGTACCTCT rCAACTCCGCAGTAACAACAAACCTCAAACTCACTCCAATAGCG 
ACACCCnCATGGAGAAGTTCACCCGTCATT ClTClTO CGAGTTTC 

AlaMaClyClnl^uAfipl^uSexClyTrpPhe^ 
8881 GCCGC?GGCCAGCTGGACTTGTCXGGCTGGTTCACGGCTGGCT 

CGGCGACCGGTCGACC1CAACAGGCCGACCAAGTGCCGACCGA1GTCGCCCCCTCTGTAA 

(Pro) 

TyrHlsSerValSerHlsAlaArgProArgTrplleTrpPheCyaLe u L e uLeuLeuAla 
394 1 TATCACA6CGTGTCTCAIGCCCGGCCCCGCTGGATCTOGTTTTGCCTACTCCTGCTTGCT 
ATAGTGTCGCACAGAG7AGGGGCCGGGGCGACC2AGACCAAAACGGA7GAGGACGAACGA 

AlaGlvValGlylleTyrLeuIeuProAsaArgOP 
9001 GCAGGGGTAGGCATCTACCTCCTCCCCAACCGATGAAGGTTGGGGTAAACACXCCGGCCT 
CGTCCCCATCCGTAGATGGAGGAGGGGTTGGCTACTTCCAACCCCATTTGTGAGGCCGGA 
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